Using Dia-mole for Unsupervised Learning of Domain-specific Dialogue Acts from Spontaneous Language

نویسنده

Jens-Uwe Möller

چکیده

This report introduces DIA-MOLE, a tool that supports an engineering-oriented approach towards dialogue modelling for a spoken-language interface. Our approach is applied to the domain of appointment scheduling. A major step towards dialogue models is to know about the basic units that are used to construct a dialogue model. DIA-MOLE does not employ theory-based dialogue units because they are subject to human interpretation and often cannot be recognized from data available in a spoken-language system. We pursue a data-driven approach and apply unsupervised learning to a sample set of spontaneous dialogues using multiple knowledge sources, i.e. domain and task knowledge, word recognition and prosodic information. Using these data, DIA-MOLE supports segmentation of turns and interpretation of their illocutionary force based on a model of the task. For this purpose we had to develop a model of interactive problem solving in the domain of appointment scheduling. As a result of learning we obtain domainand task-specific dialogue acts (DDA). A first validation of the set of learned DDAs shows that they are prominent for this domain and task. Some DDAs show significant correspondence to specific nodes in an RST-structure. Automatic DDA labeling was compared with human dialogue act labeling according to a predefined labeling scheme. Dialogue act prediction was also employed to evaluate our approach. Kurzfassung. Das Werkzeug DiaMoLE unterstützt einen ingenieursmäßigen Ansatz zur Dialogmodellierung für eine gesprochen-sprachliche Mensch-Maschine-Schnittstelle. Der vorgestellte Ansatz findet Anwendung in der Domäne Terminvereinbarung. Ein bedeutender Schritt auf dem Weg zu einem Dialogmodell ist die Kenntnis der zugrundeliegenden Einheiten, aus denen es sich zusammensetzt. In Dia-MoLE finden keine theoriebasierten Dialogeinheiten Anwendung, die menschliche Interpretation voraussetzen und die auf der Basis vorhandener Daten in einem gesprochen-sprachlichen System nur eingeschränkt erkannt werden können. Unter Verwendung verschiedener Wissensquellen, wie Domänenund Aufgabenwissen, Worterkennung und Prosodie wird anstattdessen ein datengetriebener Ansatz verfolgt. Ausgehend von diesen Daten wird in Dia-MoLE die Segmentierung von Äußerungen und die Interpretation der Illokution basierend auf einem Modell der Aufgabe vorgenommen. Hierzu mußte für die Domäne Terminvereinbarung ein Modell interaktiver Problemlösung entwickelt werden. Anschließend wird ein unüberwachtes Lernverfahren auf eine Reihe derartig vorverarbeiteter spontansprachlicher Dialoge angewendet. Als Ergebnis des Lernverfahrens werden domänenund aufgabenspezifische Dialogakte (DDA) gebildet. Weitere Dialogbeiträge können aufgrund der DDA klassifiziert werden. Erste Analysen bestätigen, daß die Menge der gelernten DDAs charakteristisch für die Domäne und Aufgabe sind. So zeigen einige DDAs signifikante Übereinstimmung mit bestimmten Knoten in einer RST-Struktur. Automatisches Labeling mit DDA wird verglichen mit human-gelabelten Dialogakten. Darüberhinaus wurde eine Dialogaktvorhersage implementiert, um die Qualität der Ergebnisse dieses Ansatzes auch in dieser Hinsicht zu evaluieren.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dia-moLE: an unsupervised learning approach to adaptive dialogue models for spoken dialogue systems

ion to 9 classes 32.91%54.51% Figure 5: Hit rate for dialogue act predictions

متن کامل

CLASSITALL: Incremental and Unsupervised Learning in the DIA-MOLE Framework

The learning algorithm CLASSITALL is a module of DIA-MOLE, a tool that supports an engineeringoriented approach towards dialogue modelling for a spoken-language interface. CLASSITALL is a descendant of the COBWEB conceptual clustering algorithm, and in this paper we especially focus on extensions that are necessary for processing data within a spoken language system environment. While most lear...

متن کامل

Unsupervised Classification of Student Dialogue Acts with Query-Likelihood Clustering

Dialogue acts model the intent underlying dialogue moves. In natural language tutorial dialogue, student dialogue moves hold important information about knowledge and goals, and are therefore an integral part of providing adaptive tutoring. Automatically classifying these dialogue acts is a challenging task, traditionally addressed with supervised classification techniques requiring substantial...

متن کامل

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...

متن کامل

Understanding Student Language: An Unsupervised Dialogue Act Classification Approach

Within the landscape of educational data, textual natural language is an increasingly vast source of learning-centered interactions. In natural language dialogue, student contributions hold important information about knowledge and goals. Automatically modeling the dialogue act of these student utterances is crucial for scaling natural language understanding of educational dialogues. Automatic ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Using Dia-mole for Unsupervised Learning of Domain-specific Dialogue Acts from Spontaneous Language

نویسنده

چکیده

منابع مشابه

Dia-moLE: an unsupervised learning approach to adaptive dialogue models for spoken dialogue systems

CLASSITALL: Incremental and Unsupervised Learning in the DIA-MOLE Framework

Unsupervised Classification of Student Dialogue Acts with Query-Likelihood Clustering

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Understanding Student Language: An Unsupervised Dialogue Act Classification Approach

عنوان ژورنال:

اشتراک گذاری